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ABSTRACT: Glycosylation is one of the most common 
protein modifications. Each glycoprotein can be glycosylated at 
multiple glycosites, and each glycosites can be modified by 
different glycans. Due to this heterogeneity of glycosylation, it 
has proven difficult to study the structure—function relation- 
ship of specific glycans and their affected glycoproteins. Here, 
we report a novel method for rapid and quantitative 
identification of glycoproteins containing specific glycans. 
Lectin affinity isolations are followed by chemical immobiliza- 
tion of the captured glycopeptides, allowing the identification 
of glycoproteins containing specific glycans by subsequent 
mass spectrometry. The application of the method should be 
useful to facilitate our understanding of how changes in glycan 
associate with diseases, and to discover novel glycoproteins 
with certain glycans that could serve as biomarkers or 
therapeutic targets. 


lycosylation of proteins is known to play an important 

role in various biological processes, and aberrant 
glycosylation also plays an important role in a large number 
of human diseases.'~* Since glycoproteins typically contain 
variable glycan structures at each glycosylation site (glyco- 
site),”° the glycans present at each glycosite on the 
glycoproteins have therefore attracted great interest, and 
elucidation of their precise nature would greatly advance our 
understanding of their role in health and disease. However, 
there are currently no methods available that are sufficiently 
specific and sensitive for the analysis of glycans at each glycosite 
in complex samples. 

Recently, two popular glycoprotein enrichment methods 
have been established: affinity capture based on glycan-lectin 
interaction and solid-phase extraction of glycopeptides (SPEG). 
For example, Kaji et al. isolated N-linked glycoproteins and 
glycopeptides using a lectin affinity capture method, which was 
followed by tandem mass spectrometry (MS/MS).’ Several 
studies reported the use of lectins for the characterization of 
glycoproteins in serum and clinical samples.*—'* However, the 
capture specificity of glycopeptides, which is defined as the 
percentage of identified peptides containing consensus N- 
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linked glycosylation motif (NXT/S sequence, while X is any 
amino acids except P), is insufficient. Presumably, the low 
specificity of this method is caused mostly by binding of 
nonglycopeptides within proteins to lectins due to the low 
affinity of the lectin—glycan interaction. 

Chemical immobilization methods, such as solid-phase 
extraction of glycosite-containing peptides (SPEG), allows for 
specific isolation and identification of glycopeptides.'*'* 
Glycans of the cis-diol groups are oxidized and covalently 
coupled onto a solid surface using hydrazide chemistry. The 
nonglycopeptides that are not coupled are removed, and the 
glycopeptides are subsequently released using endoglycosidase 
PNGase F prior to analysis using MS/MS. It has been 
demonstrated that by using the SPEG method, the glycopep- 
tides could be captured with high specificity, and then identified 
in a rapid, sensitive, and reproducible manner.'*~'? However, 
the SPEG method cannot provide any structural information 
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about glycans, since the glycans are removed from the peptides 
before MS detection. 

Here, we developed a novel method that allows for 
identification of N-linked glycosites using tandem mass 
spectrometry using lectin affinity capture followed by SPEG 
(LecSPEG). For proof-of-principle, we applied this method to 
compare the glycosite-containing peptides in the serum sample. 
We showed that this method was able to identify and quantify 
those glycopeptides from glycoproteins interacting with specific 
lectins with high specificity and sensitivity. 


E EXPERIMENTAL PROCEDURES 


Materials and Reagents. Aleuria aurantia lectin (AAL)-, 
Sambucus nigra lectin (SNA)-, and wheat germ agglutinin 
(WGA)-conjugated agarose beads were purchased from Vector 
Laboratories (Burlingame, CA). Tris (2-carboxythyl) phos- 
phine (TCEP) was purchased from Pierce (Rockford, IL). 
Sequencing-grade trypsin was from Promega (Madison, WI). 
Peptide-N-glycosidase F (PNGase) was from ProZyme (San 
Leandro, CA). Sodium periodate, hydrazide resin, and P6 
desalting spin columns were from Bio-Rad (Hercules, CA). 
C18 and MCX desalting columns were from Waters (Milford, 
MS). The high-performance liquid chromatography—mass 
spectrometry (HPLC—MS) platform used in this study 
includes an Eksigent 2D nanoLC system (Dublin, CA) and a 
Thermo Scientific Orbitrap Velos mass spectrometer (Wal- 
tham, MA). Magic C-18 packing material was from Michrom 
Bioresources (Auburn, CA), and all other chemicals were 
purchased from Sigma (St. Louis, MO). 

Serum Samples. The serum specimens and relevant clinical 
information were obtained with informed consent and with the 
approval of the Institutional Review Board of the Johns 
Hopkins University. To establish the method, pooled serum 
samples from healthy males were used. 

Glycoprotein Extraction Using Affinity Capture by 
Lectin-Conjugated Agarose Beads (Lec Meothd). Lectin- 
conjugated agarose beads (250 uL) were prewashed for three 
times with 1 mL Tris-HCl buffer (0.2 M Tris-HCl, pH 7.5). 
The beads were mixed with Tris-HCI buffer for 5—10 min, and 
the buffer was removed following centrifugation at 6000g for 5 
min. Aliquot of serum pooled (25 uL) was diluted 1:10 in Tris- 
HCI buffer then incubated with the beads at 4 °C overnight 
with gentle shaking. The unbound nonglycoproteins were 
removed by washing the beads with Tris-HCl buffer for three 
times. Beads were then incubated with 100 yL lectin-specific 
elution buffers (100 mM fucose in Tris-HCl buffer for AAL, 0.5 
mM lactose in Tris-HCl buffer for SNA, and 0.8 M N-acetyl-p- 
glucosamine in 0.2 M acetic acid for WGA) for 30 min with 
gentle shaking to elute glycoproteins. The supernatant was 
collected and desalted on a P6 column. Glycoproteins were 
denatured, reduced, and digested with trypsin as previously 
described'** Trypsin was then deactivated by incubating the 
sample at 80 °C for 15 min and 0.5 uL of PNGase F was used 
to remove N-linked glycans from the peptides. The peptides 
were desalted using a C-18 column, and the eluate was dried 
using a SpeedVac and resuspended in 10 uL of 0.4% acetic acid 
per sample. 

Glycopeptide Extraction Using Affinity Capture by 
Lectin-Conjugated Agarose Beads (Lect+Lec Method). 
Glycoproteins were extracted from a 25 mL aliquot of serum 
pooled from healthy men using the Lec method as described 
above. The eluted glycoproteins were digested with trypsin. 
The peptides were then desalted with a C-18 column and 
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incubated with 100 „L of prewashed lectin-conjugated agarose 
beads for glycopeptide enrichment using the same procedures 
for glycoprotein extraction described above. The eluted 
glycopeptides were desalted using a C-18 column, and the 
glycopeptides were treated with 0.5 yL of PNGase F to release 
N-linked glycans from the peptides. The peptides were desalted 
again and dried using a SpeedVac, before being resuspended in 
10 uL of 0.4% acetic acid per sample. 

Extraction of N-Linked Glycosite-Containing Peptides 
Using the Combination of Lectin Affinity Capture and 
SPEG Method (LecSPEG Method). The peptides enriched by 
lectins were oxidized with 10 mM sodium periodate at 4 °C for 
1 h in the dark. The oxidized peptides were purified using C18 
desalting columns to remove the oxidant and conjugated to 
hydrazide beads at room temperature for 4 h in 80% 
acetonitrile. Nonconjugated peptides were removed by washing 
the resin for three times with 800 uL washing buffer (1.5 M 
NaCl, H,O, and 100 mM of NH,HCO,). The N-linked 
glycosite-containing peptides were released from the resin with 
0.5 uL of PNGase F in 100 mM of NH,HCO, and incubated at 
37 °C overnight. After the final purification using a MCX 
desalting column, the peptides were dried by SpeedVac and 
resuspended in 10 uL of 0.4% acetic acid solution for 
subsequent mass spectrometry analysis. 

Analysis of N-Linked Glycosite-Containing Peptides 
Using a HPLC-Orbitrap-MS Platform. Peptide identification 
by liquid chromatography tandem mass spectrometry (LC— 
MS/MS) analysis was performed using an Orbitrap MS/MS 
(Thermo Fisher Scientific) interfaced with a nanoLC system 
(Eksigent). A 5 uL sample of isolated peptides was loaded onto 
the HPLC-Orbitrap-MS platform. The HPLC mobile phase A 
were composed of 0.2% formic acid in HPLC-grade water, 
while phase B were composed of 0.2% formic acid in HPLC- 
grade acetonitrile. The mobile phase B was increased linearly 
from 5% to 40% over 90 min at 300 nL/min. A C18 column 
(75 um ID x 10 cm, Magic C18 material, 5 um, 100 A) was 
used for peptide separation. Mass analysis was performed using 
a data-dependent analysis of the top ten precursors and a 
dynamic exclusion of 30 s. The data was acquired at 30000 
resolution for the precursor scans and 7500 for MS/MS. Target 
values of le6 for MS and le5 for MS/MS were set with 
maximum injection times of 100 and 300 ms, respectively. Data 
was acquired used the monoisotopic precursor selection 
(MIPS) method, with predictive AGC enabled. 

The acquired data was searched against the Homo sapiens 
taxonomy of the RefSeq database (version 40, 04/16/2010) 
using the Mascot (Matrix Science version 2.2.0) search 
algorithm within Proteome Discoverer (Thermo Scientific 
version 1.1). The data was searched with 2 missed cleavages 
allowed and a tolerance of 15 ppm on the precursors and 0.02 
Da on the fragment ions. Modifications allowed included 
carbamidomethyl cysteines set to static and deamidation of 
asparagines and oxidation of methionines set to variable. Data 
were searched against a decoy database and filtered to a 1% 
false discovery rate (FDR). The data set was further filtered to 
ensure that all the identified peptides had Mascot ion scores 
greater than 50. 


E RESULTS 


Detection of glycosite-containing peptides with special glycan 
motifs using lectin affinity capture and chemical immobilization. 

To compare the LecSPEG method with the two lectin 
enrichment methods, glycoprotein enrichment (Lec method) 
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Figure 1. Schematic diagram of the analytical procedure of glycosite detection using a lectin affinity capture method combined with chemical 


immobilization. 
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Figure 2. Reproducibility of glycopeptides captured by LecSPEG method using SNA conjugated agarose beads analyzed in quadruplicate. 
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and affinity enrichment of glycoproteins followed by enrich- 
ment of glycopeptides from the enriched glycoproteins using 
lectins (Lec + Lec method), we applied these three methods to 
aliquots from the same pooled sample of human sera to 
determine the number of identified peptides (Figure 1), the 
number of identified glycopeptides, and the specificity of 
identified glycopeptides to total peptides. 

First, we employed the lectin SNA-conjugated agarose beads 
to capture the glycoproteins by using the Lec method. A total 
number of 85 peptides were identified from the serum sample, 
with only 2 peptides containing N-linked glycosylation sites 
with NXT/S motif. The low capture specificity (2%) of 
glycosite-containing peptide identified by this method suggests 
that it is not suitable for glycosite analysis (Table 1 of the 
Supporting Information). 

Next, lectin SNA-conjugated agarose beads were first used to 
capture glycoproteins from the serum sample, and the isolated 
glycoproteins were digested to peptides. The glycopeptides 
were then further isolated by lectin SNA-conjugated agarose 
beads and analyzed by LC-MS/MS. Using this tandem lectin 
affinity capture method, 14 glycosite-containing peptides were 
identified from the same serum sample with the percentage of 
specificity or identification of glycosite-containing peptides at 
36%. Thus, the Lec + Lec procedure increased the number of 
identified glycosite-containing peptides and capture specificity 
when compared with the Lec method (Table 1 of the 
Supporting Information). 

Finally, LecSPEG was used to analyze glycoproteins from the 
same serum sample, after isolation of glycopeptides by Lec + 
Lec method. They were captured by SPEG method and 
analyzed by LC-MS/MS. We were able to identify 46 unique 
glycosite-containing peptides from the serum sample with the 
capture specificity of 82% (Table 1 of the Supporting 
Information). Clearly the lecSPEG method has the power to 
identify the most glycosites and has the highest specificity for 
the capture in crude serum among the three methods tested. 

To determine if the LecSPEG method possesses a principal 
advantage for glycopeptide detection with specific glycan 
motifs, we compared the three methods using SNA, AAL, 
and WGA-conjugated beads for glycopeptide identification. 
SNA binds to glycans containing a-2,6-linked sialic acid; AAL 
binds to glycans containing fucose; and WGA binds to glycans 
containing terminal /-N-acetylglucosamine or sialic acid.”° In 
each case, more glycopeptides were identified with higher 
capture specificity using the LecSPEG method (Table 1 of the 
Supporting Information). 

Furthermore, we tested the reproducibility of the LecSPEG 
method for glycopeptide analysis. The glycopeptides were 
isolated in quadruplicates from 25 pL serum samples by 
LecSPEG method using SNA-conjugated agarose beads. Using 
this hybrid method, 42 glycopeptides were identified in all four 
replicate samples (Figure 2; Table 2 of the Supporting 
Information). The CVs of the peak intensities of the 42 
glycopeptides ranged from 2.1% to 39.1%. Only four 
glycopeptides displayed CVs greater than 30%, and the majority 
of glycopeptides (38 of 42, 90.5%) had CVs lower than 30%, 
which indicates that this multistep method was able to generate 
reproducible results. 


© DISCUSSION 

The interactions between lectins and glycans are relatively 
weak, and a large proportion of peptides captured by lectins are 
nonglycopeptides. When we sequentially combined lectin 
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affinity capture with the SPEG method, the capture specificity 
of glycopeptides was significantly improved from 2% (Lec) to 
36% (Lec+Lec), to further 82% (LecSPEG), while the number 
of identified unique glycosites increased from 2 (Lec) to 14 
(Lect+Lec), to 46 (LecSPEG) using SNA-conjugated agarose 
beads. Similar results were also observed when using more 
lectins for glycopeptide capture, including AAL and WGA 
(Table 1 of the Supporting Information). 

In addition to its high specificity and sensitivity, the 
reproducibility of the method for glycopeptide analysis is 
important, especially for clinical research, which usually requires 
sample comparison. For the tandem SNA lectin and SPEG 
capture methods, the CVs of most identified glycopeptides 
were lower than 30%, which is comparable to those of tryptic 
peptides analyzed by LC—MS.'??"* Thus, our data indicate 
that our system is feasible for quantification analysis of clinical 
samples. 

Moreover, the glycan motif information can be obtained for 
each glycosite-containing peptide using LecSPEG detection. 
The glycosite-containing peptides identified by SNA-LecSPEG 
method contained sialic acids (a2—a6) on glycans. The 
glycopeptides identified by AAL-LecSPEG and WGA-LecSPEG 
contained fucose and terminal /-N-acetylglucosamine or sialic 
acids, respectively. When the serum samples were profiled using 
the LecSPEG method, the glycan motif information and 
quantification data based on the LC—MS peak intensity clearly 
showed the heterogeneity of glycans at each glycosite, that 
individual glycoproteins could contain multiple glycosites and 
each glycosite has its own glycan motif patterns (Table 3 of the 
Supporting Information). For example, the glycosite 
DFVnASSK of Heparin cofactor II protein contains glycan 
motifs of fucose and terminal /-N-acetylglucosamine that 
bound to lectins AAL and WGA, respectively (Table 3 of the 
Supporting Information). We can determine the relative 
quantities of the specific glycan motif at each glycopeptides 
from individual samples by directly comparing the LC-MS 
data between different samples. 

In summary, we developed a lectin affinity followed by SPEG 
capture method for glycopeptide analysis and tested the 
analytical performance of the LecSPEG in this study. We 
demonstrated that our method can be used to profile 
glycopeptides with specific glycan motifs. The newly developed 
LecSPEG method will be a useful tool for subsequent 
glycoproteomic studies in both biological and clinical studies. 
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